智能论文笔记

Physical Pooling Functions in Graph Neural Networks for Molecular Property Prediction

Artur M. Schweidtmann , Jan G. Rittig , Jana M. Weber , Martin Grohe , Manuel Dahmen , Kai Leonhard , Alexander Mitsos

分类：机器学习

2022-07-27

图形神经网络（GNN）正在化学工程中出现，以基于分子图的物理化学特性端到端学习。 GNNS的一个关键要素是合并函数，将原子矢量结合到分子指纹中。大多数以前的作品都使用标准池功能来预测各种属性。但是，不合适的合并功能会导致概括不佳的非物理GNN。我们根据有关学习特性的物理知识比较并选择有意义的GNN合并方法。通过量子机械计算计算出的分子特性证明了物理池函数的影响。我们还将结果与最近的SET2Set合并方法进行了比较。我们建议使用总和池来预测取决于分子大小的性能并比较分子大小无关的属性的池函数。总体而言，我们表明物理池功能的使用显着增强了概括。

translated by 谷歌翻译

Graph Neural Networks for Temperature-Dependent Activity Coefficient Prediction of Solutes in Ionic Liquids

Jan G. Rittig , Karim Ben Hicham , Artur M. Schweidtmann , Manuel Dahmen , Alexander Mitsos

分类：机器学习

2022-06-23

离子液体（ILS）是可持续过程的重要溶剂，并且需要预测IL中溶质的活性系数（AC）。最近，矩阵完成方法（MCM），变压器和图神经网络（GNN）在预测二元混合物的AC方面表现出很高的精度，例如宇宙RS和UNIFAC优于公认的模型。 GNN在这里特别有希望，因为他们学习了分子图到特性的关系，而无需预处理，通常是变压器所需的，并且与MCMS不同，适用于不包括训练中不包括的分子。但是，对于ILS，目前缺少GNN应用程序。在此，我们提出了一个GNN，以预测IL中溶质的温度依赖性无限稀释液。我们在包括40,000多个AC值的数据库上训练GNN，并将其与最先进的MCM进行比较。 GNN和MCM实现了类似的高预测性能，GNN还可以对培训期间未考虑的IL和溶质的AC进行高质量的预测。

translated by 谷歌翻译

Information-Theoretic Safe Exploration with Gaussian Processes

Alessandro G. Bottero , Carlos E. Luis , Julia Vinogradska , Felix Berkenkamp , Jan Peters

分类：机器学习

2022-12-09

We consider a sequential decision making task where we are not allowed to evaluate parameters that violate an a priori unknown (safety) constraint. A common approach is to place a Gaussian process prior on the unknown constraint and allow evaluations only in regions that are safe with high probability. Most current methods rely on a discretization of the domain and cannot be directly extended to the continuous case. Moreover, the way in which they exploit regularity assumptions about the constraint introduces an additional critical hyperparameter. In this paper, we propose an information-theoretic safe exploration criterion that directly exploits the GP posterior to identify the most informative safe parameters to evaluate. Our approach is naturally applicable to continuous domains and does not require additional hyperparameters. We theoretically analyze the method and show that we do not violate the safety constraint with high probability and that we explore by learning about the constraint up to arbitrary precision. Empirical evaluations demonstrate improved data-efficiency and scalability.

translated by 谷歌翻译

Temporally-Consistent Surface Reconstruction using Metrically-Consistent Atlases

Jan Bednarik , Noam Aigerman , Vladimir G. Kim , Siddhartha Chaudhuri , Shaifali Parashar , Mathieu Salzmann , Pascal Fua

分类：计算机视觉

2021-11-12

我们提出了一种从一系列时间演化点云序列中对时间一致的表面序列的无监督重建的方法。它在帧之间产生了密集和语义有意义的对应关系。我们将重建的表面代表由神经网络计算的Atlases，这使我们能够在帧之间建立对应关系。使这些对应关系的关键是语义上有意义的是为了保证在相应点计算的度量张量和尽可能相似。我们设计了一种优化策略，使我们的方法能够强大地对噪声和全局动作，而无需先验的对应关系或预先对准步骤。结果，我们的方法在几个具有挑战性的数据集中占据了最先进的。该代码可在https://github.com/bednarikjan/temporally_coherent_surface_reconstruction附近获得。

translated by 谷歌翻译

Robust deep learning-based semantic organ segmentation in hyperspectral images

Silvia Seidlitz , Jan Sellner , Jan Odenthal , Berkin Özdemir , Alexander Studier-Fischer , Samuel Knödler , Leonardo Ayala , Tim Adler , Hannes G. Kenngott , Minu Tizabi

分类：计算机视觉 | 机器学习

2021-11-09

语义图像分割是手术中的背景知识和自治机器人的重要前提。本领域的状态专注于在微创手术期间获得的传统RGB视频数据，但基于光谱成像数据的全景语义分割并在开放手术期间获得几乎没有注意到日期。为了解决文献中的这种差距，我们正在研究基于在开放手术环境中获得的猪的高光谱成像（HSI）数据的以下研究问题：（1）基于神经网络的HSI数据的充分表示是完全自动化的器官分割，尤其是关于数据的空间粒度（像素与Superpixels与Patches与完整图像）的空间粒度？（2）在执行语义器官分割时，是否有利用HSI数据使用HSI数据，即RGB数据和处理的HSI数据（例如氧合等组织参数）？根据基于20猪的506个HSI图像的全面验证研究，共注释了19个类，基于深度的学习的分割性能 - 贯穿模态 - 与输入数据的空间上下文一致。未处理的HSI数据提供优于RGB数据或来自摄像机提供商的处理数据，其中优势随着输入到神经网络的输入的尺寸而增加。最大性能（应用于整个图像的HSI）产生了0.89（标准偏差（SD）0.04）的平均骰子相似度系数（DSC），其在帧间间变异性（DSC为0.89（SD 0.07）的范围内。我们得出结论，HSI可以成为全自动手术场景理解的强大的图像模型，其具有传统成像的许多优点，包括恢复额外功能组织信息的能力。

translated by 谷歌翻译

ArchABM: an agent-based simulator of human interaction with the built environment. $CO_2$ and viral load analysis for indoor air quality

Iñigo Martinez , Jan L. Bruse , Ane M. Florez-Tapia , Elisabeth Viles , Igor G. Olaizola

分类：人工智能

2021-11-02

最近的证据表明，SARS-COV-2是2020年导致全球大流行病的病毒，主要经由室内环境中的空气机气溶胶传播。在评估和控制建筑物的室内空气质量（IAQ）时，这需要新颖的策略。 IAQ通常可以通过通风和/或政策来控制以调节人建筑物相互作用。然而，在建筑物中，占用者使用其他方式使用房间，可能并不明显哪种措施或对措施的组合导致成本和能源有效的解决方案，确保整个建筑物的良好IAQ。因此，在本文中，我们介绍了一种基于代理的模拟器，亚拟合，旨在帮助通过估计足够的房间尺寸，通风参数和测试政策的效果来帮助创造新的或适应现有建筑物，同时考虑到IAQ的结果复杂的人建筑物相互作用模式。最近公开的气溶胶模型适于计算每个房间中的时间依赖性二氧化碳（$ CO_2 $）和病毒量子浓度，每天吸入$ CO_2 $和病毒量子，作为生理反应的衡量标准。由于其模块化架构，Archabm对气溶胶模型和建筑布局具有灵活性，这允许实现进一步的模型，任何数字和房间，代理和操作的行动，反映人建筑物交互模式。我们提供了一个基于我们研究中心采用的真正平面计划和工作时间表的用例。本研究表明，先进的仿真工具如何有助于改善建筑物的IAQ，从而确保健康的室内环境。

translated by 谷歌翻译

Reproducible radiomics through automated machine learning validated on twelve clinical applications

Martijn P. A. Starmans , Sebastian R. van der Voort , Thomas Phil , Milea J. M. Timbergen , Melissa Vos , Guillaume A. Padmos , Wouter Kessels , David Hanff , Dirk J. Grunhagen , Cornelis Verhoef

分类：计算机视觉

2021-08-19

放射线学使用定量医学成像特征来预测临床结果。目前，在新的临床应用中，必须通过启发式试验和纠正过程手动完成各种可用选项的最佳放射组方法。在这项研究中，我们提出了一个框架，以自动优化每个应用程序的放射线工作流程的构建。为此，我们将放射线学作为模块化工作流程，并为每个组件包含大量的常见算法。为了优化每个应用程序的工作流程，我们使用随机搜索和结合使用自动化机器学习。我们在十二个不同的临床应用中评估我们的方法，从而在曲线下导致以下区域：1）脂肪肉瘤（0.83）； 2）脱粘型纤维瘤病（0.82）; 3）原发性肝肿瘤（0.80）; 4）胃肠道肿瘤（0.77）； 5）结直肠肝转移（0.61）; 6）黑色素瘤转移（0.45）; 7）肝细胞癌（0.75）; 8）肠系膜纤维化（0.80）; 9）前列腺癌（0.72）； 10）神经胶质瘤（0.71）; 11）阿尔茨海默氏病（0.87）;和12）头颈癌（0.84）。我们表明，我们的框架具有比较人类专家的竞争性能，优于放射线基线，并且表现相似或优于贝叶斯优化和更高级的合奏方法。最后，我们的方法完全自动优化了放射线工作流的构建，从而简化了在新应用程序中对放射线生物标志物的搜索。为了促进可重复性和未来的研究，我们公开发布了六个数据集，框架的软件实施以及重现这项研究的代码。

translated by 谷歌翻译

OpenML Benchmarking Suites

Bernd Bischl , Giuseppe Casalicchio , Matthias Feurer , Pieter Gijsbers , Frank Hutter , Michel Lang , Rafael G. Mantovani , Jan N. van Rijn , Joaquin Vanschoren

分类： (统计)机器学习 | 机器学习

2017-08-11

机器学习研究取决于客观解释，可比和可重复的算法基准。我们倡导使用策划，全面套房的机器学习任务，以标准化基准的设置，执行和报告。我们通过帮助创建和利用这些基准套件的软件工具来实现这一目标。这些无缝集成到OpenML平台中，并通过Python，Java和R. OpenML基准套件（A）的接口访问，易于使用标准化的数据格式，API和客户端库; （b）附带的数据集具有广泛的元信息; （c）允许在未来的研究中共享和重复使用基准。然后，我们为分类提供了一个仔细的策划和实用的基准测试套件：OpenML策划分类基准测试套件2018（OpenML-CC18）。最后，我们讨论了使用案例和应用程序，这些案例和应用程序尤其展示了OpenML基准套件和OpenML-CC18的有用性。

translated by 谷歌翻译

On the causality-preservation capabilities of generative modelling

Yves-Cédric Bauwelinckx , Jan Dhaene , Tim Verdonck , Milan van den Heuvel

分类：机器学习

2023-01-03

Modeling lies at the core of both the financial and the insurance industry for a wide variety of tasks. The rise and development of machine learning and deep learning models have created many opportunities to improve our modeling toolbox. Breakthroughs in these fields often come with the requirement of large amounts of data. Such large datasets are often not publicly available in finance and insurance, mainly due to privacy and ethics concerns. This lack of data is currently one of the main hurdles in developing better models. One possible option to alleviating this issue is generative modeling. Generative models are capable of simulating fake but realistic-looking data, also referred to as synthetic data, that can be shared more freely. Generative Adversarial Networks (GANs) is such a model that increases our capacity to fit very high-dimensional distributions of data. While research on GANs is an active topic in fields like computer vision, they have found limited adoption within the human sciences, like economics and insurance. Reason for this is that in these fields, most questions are inherently about identification of causal effects, while to this day neural networks, which are at the center of the GAN framework, focus mostly on high-dimensional correlations. In this paper we study the causal preservation capabilities of GANs and whether the produced synthetic data can reliably be used to answer causal questions. This is done by performing causal analyses on the synthetic data, produced by a GAN, with increasingly more lenient assumptions. We consider the cross-sectional case, the time series case and the case with a complete structural model. It is shown that in the simple cross-sectional scenario where correlation equals causation the GAN preserves causality, but that challenges arise for more advanced analyses.

translated by 谷歌翻译

Flexible Supervised Autonomy for Exploration in Subterranean Environments

Harel Biggie , Eugene R. Rush , Danny G. Riley , Shakeeb Ahmad , Michael T. Ohradzansky , Kyle Harlow , Michael J. Miles , Daniel Torres , Steve McGuire , Eric W. Frew

分类：机器人

2023-01-02

While the capabilities of autonomous systems have been steadily improving in recent years, these systems still struggle to rapidly explore previously unknown environments without the aid of GPS-assisted navigation. The DARPA Subterranean (SubT) Challenge aimed to fast track the development of autonomous exploration systems by evaluating their performance in real-world underground search-and-rescue scenarios. Subterranean environments present a plethora of challenges for robotic systems, such as limited communications, complex topology, visually-degraded sensing, and harsh terrain. The presented solution enables long-term autonomy with minimal human supervision by combining a powerful and independent single-agent autonomy stack, with higher level mission management operating over a flexible mesh network. The autonomy suite deployed on quadruped and wheeled robots was fully independent, freeing the human supervision to loosely supervise the mission and make high-impact strategic decisions. We also discuss lessons learned from fielding our system at the SubT Final Event, relating to vehicle versatility, system adaptability, and re-configurable communications.

translated by 谷歌翻译